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THE  MMBIUM  FRIMCIPLE  IN  THE  THSORI 
OF  OPTIMAL  PEOC13SSS 


-USSR^ 


[Following  is  the  translation  of  an  article  by  V,  G. 
Eoltyanskiy  in  Dokladv  Akade^.ll  Hauk  SSSR  (Reports  of 
the  Academy  of  Seierices  USSR)  v  Vol  119 >  No  6,  Moscow, 

1953,  pages  1070-.1073.] 

(Presented  by  Academician  P.  S.  Aleksandrov,  19  December  1957*) 

This  paper  borders^  on  our  joir»t  work  [I]  on  the  theory,  of 
optimal  processes.  The  investi,gations  of  the  secondary  variations 
by  R.  V«  Gamkrelidze  led  to  Pontrya^in ' s  hypothesis  of  the  maximum 
principle.  Here,  conf>lete  proof  of  the  maximum  principle  as  a  neces¬ 
sary  condition  of  optimum  is  presented.  To  obtain  this  result  the 
whole  work  Is  repeated  with  .different  excitations  (variations). 

Here  is  considered  a  point,  of  motion  x  =  (x^,. . .  in  a 
n-dimensional  phase  space  X  described  by  a  laws 

.V  -  /'  (.V‘ . A".  U)  =  /'  (A,  II)..  /  -  I . It.  (1) 

where  u  is  a  control  parameter  which  varies  within  the  bo’Andries  of  a 
certain  domain  U.  The  domain  U  may  be  any  topological  space,  for 
instance,  any  arbitrary  bounded  set  of  tVie  r-dimensional  space  of  the' 
variable  u  =  .u^^);  the  functions  fi  are  assumed  to  be  contin¬ 

uously  dependent  on  two  variables  x,  u  and  continuously  differentiable 
with  respect  to  If  the  law  of  the  control  is  k'notm,  i.e. , 

if  it  is*  given  the  variable  u(t)€'0,  then  the  system  (1)  uniquely 
describes  the  law  of  motion  of  this  variable  point;  the  control  u(t) 
must  be  piece-wi.se  continuous.  Similarly  with  earlier  work  [1]  the 
problem  is  stated;  •  , 

•'Find  a  control  u(-t;.)  which  secures  the  minimal  time  for  the 
process  of  transfer  of  a  point  x  from  a  given  initial  coordinate 
x^SX",  Tbs  control  defined  above  and  the  trajectory  corresponding 


-  1  - 


to  it  are  called  optimal  control  and  optima:,  trajectory.  Let  u(t,). 
to^t^ti,  be  a  certain  control  within  the  domain  U.  We  will  choose 
the  instants  of  time  ^  where  to<'tx<. .  <t3_  and  let 

St.!  4,  1  ==  l,....Si,  i  “  be  arbitrary  non-negative  numbers 

and  be  arbitrary  points  of  the  domain  U«  We  will  denote  ty 

a  segment 

i  ll,,  ,_,1  t  ■  -1  s  ('h,.  I  i-  ■  .  -  t  jt 

if 

if  t  does  not  belong  to  any  segment  4. 

Here  £  is  a  positive  qusrktity  which  .affects  the  control  u  it). 
Subsequently  we  will  consider  E  as  a  variable  of  the  first  order  of 
tnflntesimallty;  the  quantities  of  higher  order  of  infinitesimality 
will  be  deonoted  by  a  row  of  dots  and  ignored.  We  will  note  that  the 
control  u  (t) is  derived  by  variations  of  control  u  (t)  in  the  neighbor¬ 
hood  of  the  instants  of  time  will  denote  by  x  (t)  a 

trajectory'  originated  at  the  same  coordinate  Xq  and  corresponding  to 
varied  control  u  (t)  and  we  will  consider  a  vector 

x(ii~coi)~x(ii},  (2) 

where  St  is  a  non-negative  number. 

The  linear  with  respect  to  t  part  of  the  vector  (2)  we  will  denote  by 

®  X  {^1  —  e  o/)  =--•  X  (/j)  +  £  h.V  +  .  •  . 

If  we  consider  now  all  possible  control  u(t)  resulting  from 
variation  of  the  control  u(t)  and  independently  from  this  we  vary-s  t 
giving  it  all  possible  non-negative  values,  then  the  vectors  .ax, 
originated  in  the  point  =  x(tx),  form  a  certain ' cone  with  its  vertex 
in  the  point  x^.  means  of  fairly  simple  logic  the  following  state¬ 
ment  could  be  proven;  Lemma.  “The  cone  of  attainability  is  a  convex 
cone".  Subsequently  the  theorem  presented  is  an  Intensification  of 
the  theorem  in  [1]/ which  states  that  dimension  of  a  linear  ?:ns:d.fold 
F»  corislriisted  tSiere  is  not  greater  than  n  -  1. 

Theorem  1.  "If  a  cpne  of  attainability  K  fulfills  the  whole 

space  X  of  the  variables  x^ . x",  then  the  trajectory  x(t)  and 

the  corresponding  control  'a(t),  tQ4t-;.ti  are  not  optimal,  “ 

Thus,  if  x(t)  and  u(t)  are  optimal  than  a  convex  cope  K  cannot 
occupy  the  whole  space  X  and  therefore  the  whole  cone  K  lies  in  one 
half-space  describable  by  ai  certain  hyperplane  passing  through  a 
point  Xn.  Let  -  xji)  =  0  be  the  equation  of  this  hyperplane. 

We  assume  that  the  whole  cone  K  lies  in  a  negative  half-spaoe 

ar*(x’^  -  xx-^).£iO;  we  may  change  the  algebraic  signs  of  all  coefficients  . 

a^,  if  necessary. 

If  we  assume,  just  as  in  [1], 

/vh;(o -4’y<o. 


and  assume 


ti  (n 


i  'V,  /. 
i  nil) 


Z 


then  we  receive  'tiCtj )  =  and  the  equation  of  a  negative  half-space 
will  have  a  mode  (.v''--x‘X0. 

Consequently  we  will  get  the  following  necessary  condition  for  optimum: 

tfi,,  \h)  ■ 

where  .dx  is  determined  by  equation  (3)  feJ*  eny  varied  control  u  (t)  and 
any  .rj  t  >  0 . 

This  condition  *we  may  regard  as  a  generalization  of  the  condi¬ 
tions  (6)  and  (?)  of  [1].  In  the  work  presented  in  [1]  the  variations 
of-Su(t)  also  produced  a  certain  cone  of  attainability,  which  however 
may  be  smaller  than  the  cone  K  discussed  here.  Consequently  it  is 
possible  to  have  a  case  when  quantities  3^  and  functions  'fu.(t)  would 
satisfy  all  the  condistions  imposed  in  the  [Ij  and  would  not  conform 
with  the  condition  (4).  In  other  words,  the  condition  (4)  is  more 
strict  than  the  conditions  (6)  and  (?)  of  [11. 

From  (4)  it  is  easy  to  deduce  the  follow'ing  inequality 

■f.(b)/''4'(b),  "(b))  ■  ' <1 

or, .  we  may  say  the  function  u)  =  1:;t‘tx,u)  will  have  a  non- 

negative  value  at  the  point  (x  (tq),  '{'(tq),  u  (tq)).  L.  S.  Pontryagin 
pointed  out  that  the  function  H(x, Y,  u)  plays  an  important  role  in 
the  theoi’y  of  optimal  processes.  In  particular,  the  relations  (1) 
and  (5)  of  [1]  may  be  written  in  the  form 


i  =  1, .  . 


Theorem  2.  (L.  S.  Pontryagin's  Maximiua  Principle).  "For 

optimal  trajectory  x(t)  and  control  u(t),  tgtCtXT.  it  is  necessary 
to  have  such  variable  vector  4(t),  that  the  relations  (5)  will  be 
satisfied  and  besides  with  any  t,  t^j- •C.t  Xq »  the  following  condition 
will  be  also  satisfied 

II  {;«  (t),  1)  (t),  u  (/))  =  max  H  (x  (i). (I),  »)  ^0, 

uCU 

A  sketchy  proof  of  this  theorem  follows.  We  will  choose  a  vec¬ 
tor  '^(t)  as  outlined  above.  We  assume  that  for  a  certain 'tr,  tjj<'t'X,tq, 
and  a  certain  v6U  we  will  have 

H  {x  (t),  (x),  u  (x))  <  H  (X  (x),  (j;  (xj.'o). 

We  will  vary  the  control  u(t)  on  the  single  interval  Iq  q  of  the  dura¬ 
tion  ^^tq  q  which  starts  at  ,  and  we  will  assume  a' q  q*=  4).  Then  we 
will  have;  'x(t)  =  x(t)  with  tg)i;t^4irand  ’ 


Particularly, 

A-  (t  +  e  S/j,,)  —  r  (-c  -•{-  £  S/,,3)  1/  (jr  (t),  v)  —  f  ( r  (z),  u  (x))|  s  0/3.3  +  -  •  ■ . 

from  which  it  follows,  in  accordance  with  definition  of  H,  that 

A>C>5 

Since  on  the  intejn/al'l;''^  the  control  i,u(t)  coincides  with 

u(t),  then  on  this  interval  we  will  haves 

+  o-Lei-e  , 


Thus,  x/ith  i  Sti  accordance  with  the  second  part  o; 

(5)  we  will  have:  ^  . 

"tiT  ('r'«  iO  (0)  '(/)  (/)  +  tj)^  (/) 

==  t;1-  ti.  (0  (/)  +  'i.  (/)  SxP-  (/}  ==  0. 


relation 


Therefore  on  the  whole  interval it  holds  the 
ion  •^fX 

(0  (/)  4>.  +  e  8/, ,3)  Sx"*  (t  +  £  S/,  ,)  A  :>  0 


see  (6),  and  consequently 


ill)  I {/i)  —  (‘‘.i)'!  ‘Vu  (fi)  fii  Sx’  (/,)  -i-  . .  .1  ==  Ai  -p  . , . 


Hence,  the  ^vector  4x  »  S'x  (tj),  derived  accoi'ding  to  formulas 
(2)  and  (3)  with  =  0,  what  is  permissible,  satisfies  the  condition 
“  A>0;  but  this  contradicts  the  condition  (4), 

Closely  related  to  the  Maximum  Principle  is  the  following  im¬ 
portant  property  of  the  function  H,  which  states  that  along  the  optimal 
trajectory  i.e.  the  maxirauss,  the  existance  of  which  was  proven,  is 
non-negative.  - 

Theorem  3*  IT  x(t)  is  an  optimal  ti’ajectory,  u(t)  t^-<.;t^t|  is 
an  optimal  control,  and  '{  (t)  is  a  variable  vector,  the  existence  oT 
which  is  confirmed  in  theorem  2,  then  the  function  H  maintains  along 
^(t),  *f  (t),  u(t)  a  constant  non-negative  valuer 


H  ^  (/),  «  (/))[=  const. 


This  relation  is  proven  first  for  eveiy  segment  of  piece-wise 
continuous  control  uCt).  Subsequently  it  is  proven  that  at  every  point 
of  discontinuity  of  the  control  u(t)  the  function  K  is  not  changing 
its  value. 

These  results  are  obtained  in  a  seminar  on  the  theory  of  oscil¬ 
lations  and  automatic  control  personally  conducted  by  L.  S.  Pontryagin. 
L.  S.  Pontryagin  pointed  out  to  me  one  possible  simplification  in  my 
derivation  of  the  Maximum  Principle  proof.  Owing  to  that  suggestion 
my  proof,  as  it  is  now,  holds  for  any  arbitrary  topological  space  U; 
when  my  original  variant  had  a  redundant  construction  not  used  for  any 
useful  purpose  yet  limiting  the  proof  to  the  case  of  U  being  bounded 
domain  of  the  vector  space  with  piece-wise—smooth  frontier  and  convex 
interior  angles  at  the  joints. 
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